Application of information retrieval techniques to single writer documents

نویسنده

  • Alessandro Vinciarelli
چکیده

This work shows Information Retrieval experiments performed over handwritten documents produced by a single writer. The same retrieval task has been performed over both manual (no errors) and automatic (Word Error Rate around 45%) transcriptions of 200 handwritten texts. The results show that the performance loss due to recognition errors is acceptable and that Information Retrieval technologies can be effectively applied to handwritten data. 2005 Elsevier B.V. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Writer Identification through Information Retrieval: The Allograph Weight Vector

We show a number of promising results in writer identification, by recasting the traditional information retrieval (IR) problem of finding documents based on the frequency of occurrence of their terms. In IR, the tf-idf is a well-known statistical measure that weighs the importance of certain terms occurring in a database of documents. Here, writers are searched on the basis of the frequency of...

متن کامل

Online Writer Identification Using Fuzzy C-means Clustering of Character Prototypes

New kinds of documents such as handwritten online documents are emerging, which are produced by digital devices such as Tablet PC, personal handheld devices or digital paper coupled with digital pens. The rapid increase in the number of such handwritten online documents leads to mounting pressure on finding innovative solutions towards faster processing, indexing and retrieval of the documents ...

متن کامل

Handwritten Document Analysis for Automatic Writer Recognition

In this paper, we show that both the writer identification and the writer verification tasks can be carried out using local features such as graphemes extracted from the segmentation of cursive handwriting. We thus enlarge the scope of the possible use of these two tasks which have been, up to now, mainly evaluated on script handwritings. A textual based Information Retrieval model is used for ...

متن کامل

Content-based Information Retrieval from Handwritten Documents

This paper is about retrieving the closest matches from a set of scanned handwritten documents based on a query that is a document image. System indexing and retrieval is based on writer characteristics, textual content as well as document meta data such as writer profile. Documents are indexed using global image features, e.g., stroke width, slant, word gaps, as well local features that descri...

متن کامل

Prototyping a Vibrato-Aware Query-By-Humming (QBH) Music Information Retrieval System for Mobile Communication Devices: Case of Chromatic Harmonica

Background and Aim: The current research aims at prototyping query-by-humming music information retrieval systems for smart phones. Methods: This multi-method research follows simulation technique from mixed models of the operations research methodology, and the documentary research method, simultaneously. Two chromatic harmonica albums comprised the research population. To achieve the purpose ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Pattern Recognition Letters

دوره 26  شماره 

صفحات  -

تاریخ انتشار 2005